Essential Pentaho ETL by Gowda Aryan Kavan & Gowda Aryan Kavan
Author:Gowda, Aryan Kavan & Gowda, Aryan Kavan
Language: eng
Format: epub
Published: 2020-12-24T00:00:00+00:00
Start the Pentaho server
Action Plan
Configure your PDI client (Spoon) to PDI server.
Configure the KETTLE_HOME directory.
Chapter 4: Dealing with Data
In this chapter youâll learn about the following:
Reading files using PDI Spoon
Reading tables using PDI Spoon
Reading REST API data using PDI Spoon
The PDI ETL tool has several steps for dealing with different formats of data. The Pentaho PDI Spoon has several steps grouped by the category i.e., input, output, transformation, streaming, statistics, big data, scripting, data warehouse, bulk loading steps that allow you to read, write and transform all the structured, semi-structured and non-structured data. In this chapter, you will learn not only the basics for reading and writing data, but also all the how-toâs for dealing with them.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Distributed Machine Learning with Python by Guanhua Wang(3603)
Getting Started with CockroachDB by Kishen Das Kondabagilu Rajanna(2576)
Exploratory Data Analysis with Python Cookbook by Ayodele Oluleye(1417)
Getting Started With CockroachDB: A Guide to Using a Modern, Cloud-Native, and Distributed SQL Database for Your Data-Intensive Apps by Kishen Das Kondabagilu. Rajanna(1235)
R Web Scraping Quick Start Guide by Olgun Aydin(1082)
PostgreSQL 13 Cookbook: Over 120 recipes to build high-performance and fault-tolerant PostgreSQL database solutions by Vallarapu Naga Avinash Kumar(1016)
Mastering PostgreSQL 15 - Fifth Edition by Hans-Jürgen Schönig(689)
Apache Hadoop 3 Quick Start Guide by Hrishikesh Karambelkar(450)
Pandas for Everyone: Python Data Analysis, 2nd Edition by Daniel Y. Chen(446)
Learn SQL with MySQL: Retrieve and Manipulate Data Using SQL Commands with Ease by Ashwin Pajankar(406)
SQL Query Design Patterns and Best Practices by Steve Hughes & Dennis Neer & Dr. Ram Babu Singh & Shabbir H. Mala & Leslie Andrews & Chi Zhang(391)
Deploy Node.js on GCP: A comprehensive guide to deploying Node.js on Google Cloud Platform by Jonathan Lin(377)
Configuring Sales and Distribution in SAP ERP by Unknown(360)
Leveling Up with SQL by Mark Simon(336)
Learning Data Science by Sam Lau(325)
Intermediate Python by Oswald Campesato(321)
The Definitive Guide to Data Integration by Pierre-Yves BONNEFOY Emeric CHAIZE Raphaël MANSUY Mehdi TAZI(303)
Data Engineering with AWS: A Comprehensive Guide to Building Robust Data Pipelines by Paul Brian(296)
Pandas Basics by Oswald Campesato(294)
